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Abstract. We consider ideals and Boolean combinations of ideals. For the 
regular languages within these classes we give expressively complete automaton 
models. In addition, we consider general properties of regular ideals and their 
Boolean combinations. These properties include effective algebraic characteriza- 
tions and lattice identities. 

In the main part of this paper we consider the following deterministic one-way 
automaton models: unions of flip automata, weak automata, and Staiger- Wagner 
automata. We show that each of these models is expressively complete for regu- 
lar Boolean combination of right ideals. Right ideals over finite words resemble 
the open sets in the Cantor topology over infinite words. An omega-regular lan- 
guage is a Boolean combination of open sets if and only if it is recognizable by a 
deterministic Staiger- Wagner automaton; and our result can be seen as a finitary 
version of this classical theorem. In addition, we also consider the canonical au- 
tomaton models for right ideals, prefix-closed languages, and factorial languages. 

In the last section, we consider a two-way automaton model which is known 
to be expressively complete for two-variable first-order logic. We show that the 
above concepts can be adapted to these two-way automata such that the result- 
ing languages are the right ideals (resp. prefix-closed languages, resp. Boolean 
combinations of right ideals) definable in two- variable first-order logic. 

1 Introduction 

The Cantor topology over infinite words is an important concept for classifying languages 
over infinite words. For example, an w-regular language is deterministic if and only if it is 
a countable intersection of open sets, cf. [181 Remark 5.1]. There are many other properties 
of w-languages which can be described using the Cantor topology, see e.g. [TH[Tn]. Ideals 
are the finitary version of open sets in the Cantor topology. A subset P of a monoid M 
is a right (resp. left, two-sided) ideal if PM C P (resp. MP C P, MPM C P). In 
particular, a language L C A* is a right ideal if LA* C L. A filter is the complement of 
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an ideal. Thus over finite words, a language L C A* is a. right filter if and only if it is 
prefix-closed, i.e., if uv € L implies u G L. Prefix-closed languages correspond to closed 
sets in the Cantor topology. A language i C A* is a two-sided filter if and only if it is 
factorial (also known as factor-closed or infix-closed), i.e., if uvw G L implies v G L. Our 
first series of results gives effective algebraic characterizations of right (resp. left, two-sided) 
ideal languages and of Boolean combinations of such languages. In addition, we give lattice 
identities for each of the resulting language classes. As a byproduct, we show that a language 
is both regular and a Boolean combination of right (resp. left, two-sided) ideals if and only 
if it is a Boolean combination of regular right (resp. left, two-sided) ideals, i.e., if I is the 
class of right (resp. left, two-sided) ideals and REG is the class of regular languages, then 
REG n BI = B(REG n I). Here, B denotes the Boolean closure. 

The second contribution of this paper consists of expressively complete (one-way) au- 
tomaton models for right ideals, for prefix-closed languages, for factorial languages, and for 
Boolean combinations of right ideals. The results concerning ideals and closed languages are 
straightforward and stated here only to draw a more complete picture. Our main original 
contribution are automaton models for regular Boolean combinations of right ideals. We 
always assume that every state in an automaton is reachable from some initial state, i.e., 
all automata in this paper are accessible. 

• A flip automaton is an automaton with no transitions from final states to non-final 
states, i.e., it "flips" at most once from a non-final to a final state. Consequently, every 
minimal complete flip automaton has at most one final state which has a self-loop 
for each letter of the alphabet. Paz and Peleg have shown that if a language L is 
recognized by a complete deterministic automaton A, then L is a right ideal if and 
only if ^ is a fiip automaton [TT]. A language is a regular Boolean combination of 
right ideals if and only if it is recognized by a union of fiip automata (which do not 
have to be complete). 

• An automaton is fully accepting if all states are final. A word u is rejected in a fully 
accepting automaton A if and only if there is no w-labeled path in A which starts in 
an initial state. Nondeterministic fully accepting automata are expressively complete 
for prefix-closed languages. Moreover, if a language L is recognized by a deterministic 
trim automaton A, then L is prefix-closed if and only if A is fully accepting. 

• A path automaton is an automaton A such that all states are both initial and final, i.e., 
a word u is accepted by A if there exists a u-labeled path in A. Both deterministic 
and nondeterministic path automata recognize exactly the class of regular factorial 
languages. This characterization can be implicitly found in the work of Avgustinovich 
and Frid [I]- 

• An automaton is weak if in each strongly connected component either all states are 
final or all states are non-final. Any run of a weak automaton fiips only a bounded 
number of times between final and non-final states. Nondeterministic weak automata 
can recognize all regular languages. On the other hand, if a language L is recognized 
by a deterministic automaton A, then L is a Boolean combination of right ideals if 
and only if A is weak. Weak automata have been introduced by Muller, Saoudi, and 

schupp ng. 

• Deterministic Staiger- Wagner automata over infinite words have been used for charac- 
terizing w-languages L C A'^ such that both L and A'^ \ L are deterministic [TB] . Accep- 
tance of a run in a Staiger- Wagner automaton only depends on the set of states visited 
by the run (but not on their order or their number of occurrences). We show that. 
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over finite words, deterministic Staiger- Wagner automata are expressively complete 
for Boolean combinations of right ideals. In particular, deterministic Staiger- Wagner 
automata and deterministic weak automata accept the same class of languages. 

We note that flip automata, fully accepting automata, and weak automata yield effective 
characterizations of the respective language classes. For example, in order to check whether 
a deterministic automaton A recognizes a Boolean combination of right ideals, it suffices 
to test if A is weak. Moreover, the above automaton models can easily be applied to 
subclasses of automata such as counter-free automata [S]. This immediately yields results 
of the following kind: A regular language L is both star-free and a Boolean combination of 
right ideals if and only if its minimal automaton is weak and counter-free. 

For some classes of languages it is more adequate to use two-way automata. The rela- 
tion between two-way automata and ideals (resp. closed languages. Boolean combinations 
of ideals) is more complex than for one-way automata. In the last section, we consider 
deterministic partially ordered two-way automata (po2dfa) . Partially ordered automata are 
also known as very weak, 1-weak, or linear automata. We give restrictions of po2dfa's which 
define the right ideals (resp. prefix-closed languages. Boolean combinations of right ide- 
als) inside the po2dfa-recognizable languages. The class of languages recognized by po2dfa 
has a huge number of equivalent characterizations; these include the variety DA of finite 
monoids, two- variable first-order logic, unary temporal logic, unambiguous polynomials, and 
rankers; see e.g. [T71I1]. Some of these characterizations admit natural restrictions which 
are expressively complete for their ideal (resp. prefix-closed. Boolean combination of ideals) 
counterparts. We introduce one-pass flip po2dfa (resp. one-pass fully accepting po2dfa, one- 
pass po2dfa) as expressively complete automaton models for right ideals (resp. prefix-closed 
languages. Boolean combinations of right ideals) inside the class of po2dfa-recognizable lan- 
guages. For definitions of these automaton models, we refer the reader to Section [SJ The 
main challenge for each of the above automaton models is showing closure under union and 
intersection since standard techniques, such as sequentially executing one automaton after 
the other, cannot be applied. As a complementary result we see that weak one-pass two-way 
dfa's have the same expressive power as their one-way counterparts, i.e., recognize regular 
Boolean combinations of right ideals. 

2 Preliminaries 

Throughout this paper, A is a finite alphabet. The set of finite words over the alphabet A 
is denoted by A*; it is the free monoid over A. The neutral element is the empty word e. 
The set of nonempty words is yl+ = ^* \ {e}. If a language L C A* satisfies LA* C L 
(resp. A*L C L, A*LA* C L), then L is a right ideal (resp. left ideal, two-sided ideal). If 
L — A* \ K for some right (resp. left, two-sided) ideal K, then L is prefix-closed (resp. 
suffix-closed, factorial). Factorial languages are also known as factor-closed or infix-closed. 
Boolean combinations consist of complementation, finite unions, and finite intersections. 

Green's relations on a monoid M are defined as follows. For x,y G M let x <ti y (resp. 
X <£ y, X <j y) if there exist s,t d M such that x — ys (resp. x — ty, x — tys). We set 
X TZ y ii both x <ii y and y <tz x. The relations C and J are defined similarly involving <£ 
and <j, respectively. An element x G M is idempotent if a; = a;^. In every finite monoid M 
there exists a number a; > 1 such that x'^ is idempotent for all x £ AI . A homomorphism 
h : A* ^ M recognizes a language L <Z A* ii L = h~^{P) for some P C M , i.e., u G L\i and 
only if h{u) £ P. A monoid M recognizes L if there exists a homomorphism h : A* M 
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recognizing L. For every regular language L there exists a unique minimal finite monoid 
Synt(L) which recognizes L (and which is effectively computable as the transition monoid 
of the minimal automaton). It is the syntactic monoid of L, and it is naturally equipped 
with a recognizing homomorphism : A* ^ Synt(L), called the syntactic homomorphism. 
A language is regular if and only if its syntactic monoid is finite, see e.g. |12| . 

Lattice identities are a tool for describing classes of languages (these language classes form 
so-called lattices). Lattice identities can be defined in the general setting of free profinite 
monoids [S^. In this paper, we only introduce the w-notation. We inductively define w-terms 
over a set of variables S: Every x G S is an w-term; and if x and y are w-terms, then 
so are xy and (a;)". For a number G N and an w-term a;, we define x{n) inductively 
by (xy){n) — x{n)y(n), (a;")(n) = x(n)"', and x{n) — x for x G E, i.e., x{n) is the word 
obtained by replacing all exponents cj in a; by n!. Intuitively, a;" is the idempotent element 
generated by x with respect to all regular languages. A regular language L satisfies the 
lattice identity x — ?• y for w-terms x and y if there exists no G N such that for all n > uq and 
for all homomorphisms h : T,* ^ A* the implication /i(x(n)) G L =^> h(y{n)^ G L holds. It 
satisfies x y if x ^ y and y ^ x. 

3 Ideals and Their Boolean Combinations 

Many interesting properties over finite words can be stated as follows: There exists a prefix 
(resp. suffix, factor) which has some desirable property L C A* and we do not care about 
subsequent actions. This immediately leads to the right ideal LA* (resp. left ideal A*L, two- 
sided ideal A* LA*). Such languages and their Boolean combinations arise naturally, see 
e.g. [21 [7]. We give effective algebraic characterizations and lattice identities for the regular 
ideal languages (Proposition [T]) and the regular Boolean combinations of ideals (Theorem [2]). 
In the case of ideals, the proof is straightforward and relies on the following simple fact. If 
h : M N is & surjective homomorphism between monoids and J C M as well as J C N 
are right ideals (resp. left ideals, two-sided ideals), then h{I) and h^^{J) are also right ideals 
(resp. left ideals, two-sided ideals), i.e., ideals are closed under homomorphic and inverse 
homomorphic images. 

Proposition 1 Let L C_ A* be a regular language recognized by a surjective homomorphism 
h : A* AI onto a monoid M . The following are equivalent: 

L L is a right ideal (resp. left ideal, two-sided ideal). 

2. h{L) is a right ideal (resp. left ideal, two-sided ideal). 

3. L satisfies the lattice identity y yz (resp. y — )■ xy, y — xyz). 

Proof. Right ideals (resp. left ideals, two-sided ideals) are closed under surjective homo- 
morphisms and under inverse homomorphisms. Thus ([ij and (0) are equivalent. We have 
LA* C L if and only if for all y, z G A* we have y £ L ^ yz ^ L if and only if L satisfies 
the lattice identity y yz. This establishes the equivalence of ((IJ and ([3]) for right ideals; 
the argument for left ideals and two-sided ideals is analogous. □ 

In particular, property Q of Proposition [T] yields decidability of whether a given regular 
language is a (right, left, or two-sided) ideal of A* because the syntactic homomorphism 
hL '■ A* Synt(L) and the set hL{L) are effectively computable. Moreover, regular (right, 
left, and two-sided) ideals are closed under union, intersection, and inverse homomorphisms. 
They do not form so-called positive varieties because they are not closed under residuals 
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(even though right ideals are closed under left residuals, and left ideals are closed under right 
residuals), cf. [T2]. An easy example is L = abA* over the alphabet A — {a,b}; we have 
a G Lb^^ — LU {a} and aa ^ Lb~^, showing that Lb~^ is not a right ideal. 

In the next theorem, we consider Boolean combinations of ideals. Note that \ih : M N 
is a surjective homomorphism and /, J are ideals of M . then in general, we have h{I\J) ^ 
h{I) \ h{J). Another obstacle for Boolean combinations of ideals is the following: If L is 
regular and a Boolean combination of ideals Ki, then the Ki need not be regular. As a 
byproduct of our characterization in Theorem [51 we see that in the above situation, one can 
find regular ideals K'^ such that L is a Boolean combination of the languages K[. 

Theorem 2 Let L <Z A* be a language recognized by a surjective homomorphism h : A* ^ M 
onto a finite monoid M . Then the following are equivalent: 

1. L is a Boolean combination of right (resp. left, two-sided) ideals. 

2. h{L) is a union ofTZ-classes (resp. C-classes, -classes). 

3. L satisfies the lattice identity z{xy)^x -f-)- z(xy)'^ (resp. the identity s{ts)'^ z o {ts)'^ z, 
the identity s{tsY z{xyY x {tsYz{xyY ). 

Proof. We show ([U <^ ([2]) and (0) <^ dS]) for right ideals. Left ideals and two-sided ideals 
are similar. For words u, w G A* we write u = v ii h{u) — h{v). 

© Let L Boolean combination of right ideals. Then L = [J"^i Pi \ Qi for right 

ideals Pi and Qi. To see this, we first use De Morgan's law in order to move negations inwards 
so that neither any intersection nor any union is negated. Then we perform an induction on 
the resulting positive Boolean expression. For right ideals and negations thereof the claim 
is trivially true. For union the induction step is also clear. Let now L — Li n L2 and let 
Li = \J^P^\ and L2 = U, Pj \ Q'r Then L = [^,^^iP^ n /^') \ {Q, U Q'^) and the claim 
follows since right ideals are closed under union and intersection. Consider u, v such that 
h{u) TZ h{v) and let x,y £ A* such that v = ux and u = vy. Suppose h{u) G h{L). Let 
Uj — u{xyy and Vj — UjX. Now, Uj = u, Vj = u, and Uj is a prefix of Vj which in turn is a 
prefix of Uj+i. Every Uj is in L and hence for every j G N there exists i £ {1, . . . , n} such 
that Uj G Pi\Qi- By the pigeonhole principle there exist j < k with Uj, Uk S Pi\Qi for some 
i G {1, . . . , n}. Then Vj G PiA* C P^. If Vj G Qi, then Uk G QiA* C Qj and Uk ^ Pi \ Qi, a 
contradiction. Thus Vj ^ Qi and vj £ Pi\Qi C L. Hence, h{v) — h{vj) G h{L). This shows 
that h{L) is a union of 7?.-classes. 

© Let R be an 7?.-class of M. Consider the two right ideals i?' = {a; | a; <ti R} 

and R" = {x \ x <tz R}. Then h-^{R) = h-^{R') \ h-^{R") is a Boolean combination of 
right ideals (since right ideals are closed under inverse homomorphisms). With h{L) being 
a finite union of 7?.-classes, the claim follows. 

© Suppose h{L) is a union of 7?.-classes. For every sufficiently large n > 1 we have 

h[z{xyy^) TZ h(^z{xyy^x) for all x,y,z £ A*. Thus z{xy)" £ L ^ z{xy)^x G L, showing the 
lattice identity. 

© =^ ©: Suppose h{w) TZ h{z) G h{L). Then there exist x,y <E A* such that z = wy 
and w = zx. We have z = z{xy)'^ for all n G N. Hence, z{xy)" G L. Choosing n sufficiently 
large, the lattice identity yields w = z{xy)^x G L and h{w) G h{L), showing that h{L) is a 
union of 7^-classes. □ 

Since Theorem [2] © can be verified effectively for the syntactic homomorphism, it is 
decidable whether a given regular language is a Boolean combination of right ideals (resp. 
left ideals, two-sided ideals). 
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Every 7?.-class is the set difference between two right ideals. Thus if L is a Boolean 
combination of (arbitrary) right ideals and if L is recognized hy h : A* M, then by 
Theorem [5J the language L can also be written as a Boolean combination of right ideals Ki 
such that each Ki is recognized by h. The situation for Boolean combinations of left ideals 
(resp. two-sided ideals) is similar. 

For finite monoids, J' is the smallest equivalence relation such that 72, C and L ^ J ^ see 
e.g. Proposition A. 2. 5 (2)]. Hence, it follows from Theorem [5] that a regular language L 
is a Boolean combination of two-sided ideals if and only if L is both a Boolean combination 
of right ideals and a Boolean combination of left ideals. 

In Boolean combinations of right ideals, intuitively speaking, what happens is that the 
end of words is "concealed." Appending a new symbol as an end-marker to a language yields 
a Boolean combination of right ideals. Specifically, if L is language over A \ {a}, then La 
is a Boolean combination of right ideals of A* because La — La A* \ LaA^. In Section O 
we will avoid this "revealing" of the end of the word by the right end marker by considering 
one-pass automata. 

4 One-way Automaton Models 

As usual, an automaton A = {Q,A,S,Qq,F) is given by a finite set of states Q, an input 
alphabet A, a transition relation 5 ^ Q x A x Q, a, set of initial states Qq C Q, and a set of 
final states F C Q. For transitions (p, a,q) G 6 we write p q and we inductively extend 
the transition relation to words: q ^ g for all q € Q; and p q if there exists some 
r Cz Q such that p r ^ q. A run on a word ai ■ ■ ■ an with G ^ is a sequence of states 
9091 ■ ■ ■ qn such that go G Qo and g^-i ^ qi for all i. We always assume that all states are 
accessible, i.e., for every q G Q there exist go G Qo and u G A* such that go g. A word 
u e A* is accepted by A if there exist p G Qo and q £ F such that p ^ q. The language 
recognized by A is L(A) = {u £ A* \ u is accepted by A}. The automaton A is complete if 
for every p £ Q and for every a & A there exists at least one state q £ Q such that p q\ 
it is trim if for every q Cz Q there exists w G A* and p E F such that q ^ p; and it is 
deterministic if |Qo| = 1 and for all p G Q and all a G A there is at most one state q £ Q 
with p q. 

In the remainder of the section, we give automaton models for regular right ideals, prefix- 
closed languages, factorial languages, and Boolean combinations of right ideals. The results 
concerning ideals and closed languages are straightforward and presented here only for the 
sake of completeness. Our main original contribution is Theorem [71 where we give three 
automaton descriptions of Boolean combinations of ideals: deterministic weak automata, 
deterministic Staiger- Wagner automata, and unions of deterministic flip automata. 

A flip automaton is an automaton such that p €z F and p q implies q £ F. The idea 
is that, in every run, flip automata can "flip" at most once from non-accepting to accepting. 
Note that the language of a complete flip automata remains unchanged if we add a self-loop 
g g for every state g G F and every letter a £ A. 

Proposition 3 Let L C_ A* be recognized by a complete deterministic automaton A. Then 
the following are equivalent: 

L L is a right ideal. 

2. A is a flip automaton. 

3. L is recognized by some complete (nondeterministic) flip automaton. 
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Proof, (d]) (HI): Let A = {Q, A, 6, qq, F) and suppose p q iov p e F and a G A. Since 
every state is reachable, there exists a word u € A* such that go P- In particular, u £ L. 
Since L is a right ideal, we have ua G L. Now, qq q yields q E F. The implication 
© © is trivial. 

® ^ Let A' = {Q, A, S, Qo, F) be a complete flip automaton recognizing L. Suppose 
u G L, and let a G A* he arbitrary. Then q^ ^ p for go G Qo a-nd p F. In addition, since 
is complete, we have p q. Moreover, q G F because A' is a flip automaton. This shows 
LACL and thus LA* C L. □ 

The equivalence of ([T]) and ^ in Proposition [3] is due to Paz and Peleg [TT] . Of course, 
not every complete nondeterministic automaton which recognizes a right ideal has to be a 
flip automaton. Note that arbitrary {i.e., non-complete and nondeterministic) flip automata 
can recognize all regular languages. 

A fully accepting automaton is an automaton in which all states are final, i.e., F — Q. 
The only possibility to reject a word is a missing outgoing transition at some point of the 
computation. Complementing Proposition [3] leads to the following characterization of fully 
accepting automata. 

Corollary 4 Let L Q A* be recognized by a deterministic trim automaton A. Then the 
following are equivalent: 

L L is prefix-closed. 

2. A is fully accepting. 

3. L is recognized by some (nondeterministic) fully accepting automaton. 

Proof. (HI) (HI): Let A — {Q,A,S,qo,F) and assume p G Q \ F. Since A is trim, there 
exist q G F and u,v G A* such that qo ^ p ^ q. Now, uv G L implies u G L and p G F, a. 
contradiction. 

The implication ([2]) => ([3]) is trivial. 

(El =^ (HI): Let A' — {Q, A,S,Qo, F) be a nondeterministic fully accepting automaton 
recognizing L. Suppose u ^ L, i.e., for every go G Qo and g G Q we have (go, u, g) ^ S. Thus 
for every v G A* and every go G Qo and q G Q we have (go, uv, g) ^ 5, i.e., we have uv ^ L. 
This shows that A* \ L is a right ideal. □ 

A path automaton is an automaton such that every state is both initial and final, i.e., 
Qq = F = Q. In particular, a path automaton accepts a word u G A* ii and only if there 
exists a path p ^ q for some p,q G Q. 

Corollary 5 Let L G A* be a regular language. Then L is factorial if and only if L is 
recognized by a path automaton. 

Proof. "=>": By Corollary S] (and since L is prefix-closed), the language L is recognized by a 
deterministic fully accepting automaton A — (Q, A, S, go, Q). We show L{Q, A, S, Q, Q) C L. 
Suppose p g for some v G A* . Then there exists u G A* such that go ^ p, i.e., uv G L. 
Since L is suffix-closed, we have v G L. 

Let A = {Q,A,6,Q,Q) be a path automaton with L{A) ~ L. Then A as well as 
the automaton obtained by reversing all edges (and interchanging initial and final states - 
which in this case has no effect) are fully accepting. Thus, by CoroUaryUJ both the language 
L and its reversal L"^ = {ai • ■ • a„ G A* | Oi G A, a„ • • • ai G L} are prefix-closed. It follows 
that L is factorial. □ 
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For deterministic transition relations, the statement of Corollary [S] can be found implicitly 
in the work of Avgustinovich and Frid |1J . 

An automaton is weak if for every strongly connected component C Q, we either have 
C C ForCnF = 0. The concept of weak automata has been introduced by MuUer, 
Saoudi, and Schupp |10| for alternating tree automata. A Staiger- Wagner automaton is 
given hy B = {Q,A,S,qo,T) where T C 2*5. Acceptance of a run by a Staiger- Wagner 
automaton only depends on the set of states visited by the run. A run qoqi ■ ■ ■ Qn is accepting 
if {qo, qi, . . . , q„} G T; and a word is accepted if it has an accepting run. 

Lemma 6 Let A = {Q, A,d,QQ, F) be a weak automaton. Then there exists T such that 
the Staiger- Wagner automaton B = (Q, A,d,QQ,l') recognizes L{A). 

Proof. Let future(g) denote the set of states which are reachable from q and which are not 
located in the same strongly connected component as q. We can construct T as follows: 

r=={T| BqeFnT: TCQ\ future(g)} . 

Each element of T guarantees, that a run ends within an accepting strongly connected 
component of A. Since A is weak, we conclude L{A) — L{B). □ 

Our next result shows that both deterministic weak automata and deterministic Staiger- 
Wagner automata are expressively complete for Boolean combinations of right ideals. More- 
over, if a deterministic automaton A recognizes a Boolean combination of right ideals, then, 
by Lemma [5J the automaton A itself can be equipped with a Staiger- Wagner acceptance 
condition. A third automaton model for Boolean combinations of right ideals is given by 
unions of (not necessarily complete) deterministic flip automata. This last property follows 
from Theorem [5] since the inverse homomorphic image of every 7^-class of a finite monoid is 
recognizable by a flip automaton. 

Theorem 7 Let L C_ A* be recognized by a deterministic automaton A. Then the following 
are equivalent: 

1. L is a Boolean combination of right ideals. 

2. A is weak. 

3. L is recognized by some deterministic Staiger- Wagner automaton. 

4. L is a finite disjoint union of languages L{Bi) such that each Bi is a deterministic flip 
automaton. 

Proof (H]) ^ ©: Let A ^ {Q, A, S, qo, F). Assume p ^ q and q ^ p ioT p e F. Choose 
z A* such that qo p. Then for all n G N we have z{xy)'^ £ L. By Theorem [51 the 
language L satisfies the lattice identity z{xy)'^ z{xy)'^x. Therefore, for some n, we have 
z{xy)"x G L. Now, S{qQ, z{xy)"x) = q implies q ^ F. 
The implication ([2]) => (O follows by Lemma El 

(El) =^ CJ: Let B — {Q,A,6,qQ,T) be a deterministic Staiger- Wagner automaton. We 
show that L{B) satisfies the lattice identity z{xy)'^ ^ z{xy)'^x. Let x,y,z G A* and let n > 
\Q\. Let go ^ 9i and qi qi+i for 1 < i < n. By choice of n there exist l<fc<^<n+l 
such that qk = qg. It follows that, for all to G N, the runs of the words z{xyY~'^{xy)™ and 
z{xyY~'^{xy)™x both visit the same states as z{xyY~^. In particular, z{xy)" G L if and 
only if z{xy)'^x G L (which proves the lattice identity z{xy)'^ o z{xy)'^x for L). 

© Let A = {Q, A, 6, qo, F) be weak. For a strongly connected component C C F 

we define Be = {Qc, A,6c,qo: F ^ C) as the (not necessarily complete) flip automaton 
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with states Qc = {9 G Q | C is reachable from q}^ and its transition function 5c is the 
restriction of 5 to states Qc- Then L{A) — IJ^L(Sc') where the union ranges over all 
strongly connected components C C F. Note that this union is indeed disjoint because A 
is deterministic. 

0=^(0): Every flip automaton is weak. Moreover, languages recognized by weak autom- 
ata are closed under union by the equivalence of ([T]) and ©. □ 

Both nondeterministic weak automata and nondeterministic Staiger- Wagner automata 
are expressively complete for the class of all regular languages as the next lemma shows. In 
particular, the nondeterministic variants of weak automata and Staiger- Wagner automata 
do not characterize Boolean combinations of right ideals. 

Lemma 8 Let L C_ A* . The following are equivalent: 

1. L is regular. 

2. L is recognized by a (nondeterministic) weak automaton. 

3. L is recognized by a (nondeterministic) Staiger- Wagner automaton. 

Proof. ([1]) ©: Let L be recognized by the deterministic automaton A — {Q,A,S,qo,F) 
and let / ^ Q be a new state. Let 

(5' = (5 U {{p, a, f) I (p, a, g) e (5 and q e F} . 

We set Qo ~ {90, /} if 90 G and Qo = {<Zo} otherwise. This way we introduce a single 
accepting state / which can be reached nondeterministically if and only if there was a path 
from the initial state to some final state in A. Thus (Q, A, 5' , Qq, {/}) recognizes L. 
(HD =^ ©: follows from Lemma O 

© ©: Let B = {Q, A,d,Qo,T) be a nondeterministic Staiger- Wagner automaton. 
We can construct A = {2'^ x Q, A,S' ,Q'o, F) with {{P,q),a,{P' ,q')) £ S' if and only if 
(g, a,q') e 5 A P' = P U {q'}. The set of initial states is Q'q = {{{q} ,q) \ q e Qo}, and the 
set of final states is defined by {P, q) ^ F ii and only if P G T. This way, A simulates B 
along each path and collects the visited states. It accepts, if the set of visited states is in T- 
Theieiore L{A) = L{B). □ 

Remark 9 Proposition ( resp. Corollary Theorem \^ yields another decision procedure 
for the class of regular right ideals (resp. prefix-closed languages, Boolean combinations of 
right ideals). In the case of Proposition \^ this was first observed by Paz and Peleg fll}/ . 
Moreover, the above decidability results can often be combined with other automaton models. 
For example, a well-known result of McNaughton and Papert says that a language is definable 
in first- order logic if and only if its minimal automaton is counter-free Together with 
Theorem\^ we see that a language L is a first- order definable Boolean combination of right 
ideals if and only if the minimal automaton of L is weak and counter-free. O 

5 Two-way Automaton Models and Languages in VA 

The results in the previous section can easily be translated into characterizations of regular 
left ideals (resp. sufHx-closed languages. Boolean combinations of left ideals) by considering 
automata which read the input from right to left. Varying the direction of the head movement 
naturally leads to two-way automata. The situation for arbitrary two-way automata is more 



9 



involved than for one-way automata; the main reason is that two-way automata are usually 
defined using left and right end markers. On the other hand, if L C {A \ {a})*, then 
La = La A* \ LaA~^. This shows that by adding an explicit end marker, every language 
becomes a Boolean combination of right ideals. To overcome this, we introduce the notion 
of one-pass two-way automata; these automata stop processing the input as soon as they 
read the right end marker. Now, the problem with classes of one-pass two-way automata is 
that, in general, they may not be closed under union and intersection (standard techniques, 
such as executing one automaton after the other, cannot be applied). We have no satisfactory 
solution for arbitrary two-way automata, but we show that the concepts of Section |4] can 
be adapted to a well-known subclass of two-way automata, namely deterministic partially 
ordered two-way automata (po2dfa). The class of languages recognized by po2dfa is a natural 
subclass of the star- free languages which has a huge number of different characterizations, see 
e.g. [TTIH]. The most prominent of these characterizations is definability in two- variable first- 
order logic. By a description of algebraic means, it is the language variety VA, i.e., the class 
of regular languages satisfying the lattice identity p{xy)^ q-n-p{xy)^ x{xy)'^ q. As a byproduct, 
we show that some of the other characterizations of po2dfa recognizable languages also admit 
natural counterparts for right ideals and their Boolean combinations. 

A two-way automaton is a tuple A = {Z, A, 6, Xq, F). The finite set of states Z — XUY is 
partitioned into right-moving states X (for neXt) and left-moving states Y (for Yesterday). 
The states in Xq C X are initial and states in F C Z are final. On input u G A* , the tape 
content is i>m<i where > and < are new symbols marking the left and right end of the tape, 
respectively. Initially, the head is at the first letter of u. The direction in which the input 
is processed can be controlled by A. The idea is that before a transition is made, the head 
movement is performed, and the direction of the movement depends only on the destination 
state of the transition. The left end marker > must not be overrun. More formally, the 
transition relation satisfies 6 C (Z x A x Z) U (Y x {>} x X) U {X x {<} x Z). As for 
one-way automata, we write z z' instead of (z, a, z') G 5. More formally, a configuration 
is a pair {z,i) G Z x N where z is the current state and i is the current position on the tape. 
Suppose position i is labeled by a G yl U {>, <i}. Then a transition {z,i) {z',j) between 
configurations exists ii z z' and j — i -\- 1 (for z' G X) or j = i — 1 (for z' G Y). A 
computation of A on input u is a sequence 

{zaJo) \-A ■■■ \-A {zt,it) 

of configurations such that zq G Xq, zq = 1, i/t G {0, . . . , |w| -f- 1} for 1 < k < t, and 
it — |u| -I- 2. Note that position is labeled with the left end marker i> and the position 
|w| -|- 1 is labeled with the right end marker <. The computation is accepting ii zt & F is final 
and the input u is accepted if there exists an accepting computation for it. Note that by the 
signature of the transition relation, the left end marker [> cannot be trespassed. One-way 
automata may be seen as special cases with Y = %. The language L{A) recognized by A is 
L(A) = {u & A* \ A accepts u}. 

A two-way automaton is deterministic if \Xq\ — 1 and for all z G Z and all a G A U {>, <} 
there exists at most one z' E Z with z z'. For technical reasons, we also consider the 
empty automaton {Z — S — Xq = F = 0) as deterministic. It is complete if for all z G Z and 
all a there exists z' G Z with z z' (more precisely, we require the existence of z' if either 
z G Y and a G AU {o} or if z G AT and a G AU {<}). A two-way automaton is one-pass if 
z ^ z' implies z — z' . The idea is that a two-way automaton has finished "one pass" when 
it encounters the right end marker <i for the first time; hence for a one-pass automaton, the 
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acceptance of a word is determined by the state when scanning <i for the first time. The 
automaton is partially ordered if there exists a partial ordering C of the states such that 
transitions are non-descending, i.e., if z z', then z C z'. In other words, once a state is 
left in a partially ordered automaton, it is never re-entered. We abbreviate "deterministic 
partially ordered two-way automaton" by po2dfa. 

Schwentick, Therien, and Vollmer |14j have shown that po2dfa are expressively complete 
for XM.. The main result of this section is a characterization of Boolean combinations of right 
ideals (resp. right ideals, prefix-closed languages) in VA in terms of subclasses of one-pass 
po2dfa. 

A crucial property of one-pass po2dfa is closure under Boolean combinations; and to see 
this, we shall need the following synchronization lemma. The same lemma was formulated 
already in Lemma 8] for Biichi automata and infinite words. The alphabet of a word u is 
denoted by alph(it). The ith letter of u is u{i). 

Lemma 10 (Synchronization Lemma) Consider a po2dfa A with states Z = X (j Y . 
For every v — ai ■ ■ ■ a.,„ G r+ there exists a po2dfa C with states Zc — Z y. {u} x {1, . . . , to} 
such that, for all u G F* having a factorization u = uiOi ■ ■ ■ UmO-mu' with Oi ^ alph(ui), the 
following simulation property holds: If 

{za,io) "ta {zi,ii) ••• (2„,i„) 

is a sequence of transitions of A for some n>l with io — in — \uiai ■ ■ ■ UmCiml o-nd it < in 
for all 1 < t < n, then 

((zi,u, fci),ii) he ■•■ he ((z„,w,fc„),i„) 

is a sequence of transitions of C with ki — kn — m such that there exists no 1 < t < n with 
Zt G X, kt = TO, and u{it) = flm. □ 

Intuitively, this means that if a deterministic po2-automaton moves left at some point in 
its computation, then it may recognize the position on the input on-the-fly — provided that 
this happens at a suitable position, i.e., the am in the factorization stipulated in Lemma [TOl 
In the latter application, determinism will yield such a factorization and for a partially 
ordered automaton the parameter m can be bounded over all inputs u G A*. Note that [51 
Lemma 8] was formulated with Biichi automata on infinite words. However, the acceptance 
condition does not influence the statement at all and, since the computations considered 
in the lemma take place completely on the finite prefix uiai ■ ■ ■ UmO-m, the behavior of the 
automata is independent of the suffix u' which may even be an infinite word. 

Lemma 11 The class of languages recognizable by one-pass po2dfa is a Boolean algebra. 

Proof. Suppose that A ~ (X U Y,A,6,xq,F^ is a one-pass po2dfa. By adjoining a new 
non-final right-moving sink state, we may assume that A is complete. Then Al = {X U 
Y, A,8,xq,X \ F) recognizes A* \ L{A). Therefore, one-pass po2dfa are closed under com- 
plementation. It remains to show closure under union. 

We describe a product automaton construction for the union of two automaton which 
executes both automata in parallel. Of course, there is only one head to do this, and the 
main problem to overcome in this construction is when the automata disagree on the head 
movement. We shall only give a high-level description of the construction; details can be 
implemented similarly to the situation for deterministic po2-Biichi automata |8j. 
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By adding a new sink state as needed, we may assume that both automata are complete. 
The two automata are simulated in parallel as long as they agree on moving to the right. 
This is called the synchronous mode. If at least one of the automata changes to left-moving, 
then we start a simulation of one of the left-moving automata in the so-called asynchronous 
mode while suspending the other automaton. We refer to the position of the input where 
this divergence happens as the synchronization point. In asynchronous mode, the active 
automaton can move in either direction. As soon as the synchronization point is reached 
again and both automata agree on moving to the right, we switch back to synchronous mode 
and continue simulating both automata in parallel; otherwise we stay in asynchronous mode 
while simulating one of the automata. To implement this idea, the synchronization point 
must be recognized while in the asynchronous mode. 

For this re-synchronization, we use Lemma [TO] and some combinatorial property of com- 
putations of po2dfa. Assume that we are about to enter the asynchronous mode. Suppose 
the input u is factorized u = uiai ■ ■ ■ Uma-mu' such that the a^'s correspond to the positions 
where during synchronous mode at least one of the automata changed its state. Note that 
Om corresponds to the synchronization point because a change from right-moving to left 
moving implies a change of state. By determinism, we have Oi ^ alph(ui). Moreover, since 
both automata are partially ordered, m is bounded by the sum of the number of states of 
both automata. The last observation allows to store the word v = ai ■ ■ ■ Om in a bounded 
stack of letters in the state space. Using the automaton from Lemma [TO] for v as the ac- 
tive automaton, we can simulate the active automaton whilst being aware of whenever the 
synchronization point is reached again. Both automata are complete and thus the synchro- 
nization point is eventually reached by the active automaton. After this, we switch back to 
synchronous mode to simulate both automata in parallel. In synchronous mode the stack of 
letters is administered, i.e., whenever a state change happens in one of the automata whilst 
in synchronous mode, the currently scanned letter is pushed to the stack. At the end, we 
accept if one of the automata accepts. 

The procedure given above can be done effectively in such a way that the simulating 
automaton is a complete, deterministic one-pass po2-automata. The actual construction is 
along the lines of the proof of [8 , Proposition 9] and therefore not given here. □ 

A monomial is a language P = AJai • • • A^a^A^^j^ where Ai ^ A and a.; G A. It is 
unambiguous if every word u £ P has a unique factorization u = UiOi ■ ■ ■ UkOkUk+i with 
Ui A* . A convenient intermediate step from languages in VA to automata are rankers. 
A ranker is a word in {Xq, | a £ A}*. Intuitively, a ranker r represents a sequence of 
instructions for "next a-position" and for "previous a-position" which is processed from 
left to right. That is, for a word u — ai ■ ■ ■ On with aj G A and a position i G {0, . . . , n -I- 1} 
we set e{u, i) = i and 



If a nonempty ranker r starts with an Xa-modality, then we say that r is an X-ranker; and we 
define r{u) — r{u, 0), i.e., the evaluation of X-rankers starts at the beginning of the word u. 
Symmetrically, if r starts with Ya, then r{u) = r{u,n + 1). As usual, min0 and max0 are 
undefined. Thus a nonempty ranker r either defines a unique position r{u) in a word u or 
r{u) is undefined. For example, XaYbXc(6ac) — 3 whereas XaYbXc(c6a) is undefined. For a 
ranker r we set L{r) = {u Cz A* \ r{u) is defined}. 



Xar{u,i) 
yar{u,i) 



r{u,miii{j > i \ Oj — a}), 
r{u,max{j < i \ aj — a}). 
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Theorem 12 Let L <Z A* . The following are equivalent: 

1. L Cz 'DA{A*) is a Boolean combination of right ideals. 

2. L is a finite union of unambiguous monomials A^ai ■ ■ ■ A^.a^A'^^^ 
with {ui, . ■ ■ , ttk} ^ Ai for all i € {1, . . . ,k}. 

3. L is Boolean combination of languages L{r) forX-rankers r. 
4-. L is recognized by a one-pass po2dfa. 

Proof. Before turning to the actual proof, we give a rough overview of the techniques 
employed. Right ideals are the finitary version of open sets in the Cantor topology over 
infinite words. It is therefore not surprising that a large part of Theorem [TH reduces to 
infinite words: The proof of the implication from (IT|) to Q relies on a result of Diekert and 
Kufleitner [5, Theorem 6.6]. The step from © to Q uses a characterization of X-rankers 
over infinite words O Theorem 3]. Showing the implication from ([3]) to Q is the most 
technical part. In particular, one has to show that one-pass po2dfa are closed under union 
and intersection. Here, the respective result for po2-Buchi automata cannot be applied 
directly, but showing closure under union and intersection resembles techniques which were 
developed for deterministic po2-Biichi automata [B]. This is Lemma [TT] Finally, the step 
from back to ([1]) easily follows by combining the characterization of po2dfa due to 
Schwentick, Therien, and VoUmer [TU Theorem 3.1] with Theoremj^l We need to introduce 
some more notation for the proof. 

A monomial A^ai ■ ■ ■ A^afcA^_|_-^ is restricted if there exists no i e {1, . . . , fc} such that 
{oi, . . . ,ak} C Ai. Let DA be the class of finite monoids satisfying the identity (xi/)" = 
{xy)^ x{xy)^ . A language L is contained in DA if and only if it is recognized by a homomor- 
phism h : A* M to a finite monoid in DA. The set of finite and infinite words over A is 
A°° . The w-iteration of a language L C A* of finite words is — {wiU2 • ■ • € A°° \ Ui G L}; 
in particular we stipulate the convention = e. Note that it will always be clear from 
the context whether by "w" we mean an infinite product or a generated idempotent. Let 
ft, : A* — > M be a homomorphism to a finite monoid M. For x £ M let [x] — h~^{x). A 
language K C A°° of finite and infinite words is recognized by h if 

K = [j {[s][e]" I [s][e]" n L ^ and s ^ se, ^ e] . 

Note that [1]" also contains finite words. The evaluation of an X-ranker r extends naturally 
to infinite words and the L{r) over A°° consists of all finite or infinite words on which r is 
defined. For more details we refer to [3]. 

© =^ @ • By Theorem [5] the language L C A* is recognized by some h : A* ^ M £ DA 
such that h{L) is a union of 7?.-classes. Consider the language 

K = [j {[s][e]" I [s] C L and s = se, = e} 

of finite and infinite words. We have n A* = L and K is recognized by h. Consider 
s,t,e,f e M such that s = se, t = tf, e^ = e and — f. Then because h{L) is a 
union of 7^-classes, s TZ t implies [s][e]" C K if and only if C K. The language K 

is a finite union of restricted unambiguous monomials A^ai • • • A^.afcA^-^ over A°°, see O 
Theorem 6.6]. Therefore, L — K nA* is a finite union of restricted unambiguous monomials 
A>i • • • A*.afeA*._^^ over A*. 

© =^ Q: Let L be a finite union of restricted unambiguous monomials of the form 
A*ai • ■ • A^flfcA^^-^. Let K C A°° be obtained by replacing these monomials by the mono- 
mial A*ai • ■ • A^flfcA^j^. Then K is a union of restricted unambiguous monomials over A°° . 
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Now, K is definable over in the first-order fragment A2[<], see [5, Theorem 6.6]. Thus K 
is a Boolean combination of languages L{r) over A°° for X-rankers r, see [21 Theorem 3]. It 
follows that L = KnA* is a Boolean combination of languages L{r) over A* for X-rankers r. 

® =^ It is easy to see that every language L{r) for an X-ranker r is recognizable 
by a one-pass po2dfa. With Lemma [TTl we get closure of one-pass po2dfa under Boolean 
operations. 

O ^ d): Let L be recognized by a complete one-pass po2dfa A. In particular, ^ is a 
po2dfa and thus L € VA, cf. [HI Theorem 3.1]. Let n be a number greater than the number 
of states of A and let x,y,z € A*. We claim that z{xy)'^ S L if and only if z{xy)"'x € L: 
Consider the run of A on either word. Let q be the state in which A leaves the prefix z{xy)"' 
for the first time. Note that this must happen eventually since A is complete and the left 
end marker > cannot be trespassed. Then q is right-moving and q q is a, loop for all letters 
in a G alph(a;?/) by choice of n. Hence, A encounters the right end marker < in the state q 
on both inputs z(xy)"' and z{xy)"'x. Therefore, z{xy)"' is accepted if and only if z{xy)"'x is 
accepted. By Theorem [5J the language L is a Boolean combination of right ideals. □ 

It is decidable whether a given regular language belongs to T>A. Therefore, using Propo- 
sition [1] and Theorem ^ it is decidable whether a regular language is recognized by an 
arbitrary (resp. flip, fully final) one-pass po2dfa. The temporal logic version of X-rankers is 
denoted TLx [Xq , Yq] , cf. ^ ; it is a fragment of deterministic unary temporal logic TL[Xa , Xj] 
over the modalities Xa and Yq. The logic TL[Xa,Ya] is expressively complete for VA, and 
TLx[Xa; Ya] defines the right ideals in VA. 

Remark 13 We use the shortcut "nfa" /or nondeterministic finite automaton, and "pol" 
for partially ordered one-way. Using this notation, we have the following inclusions between 
language classes recognizable by partially ordered automata: 

poldfa C one-pass po2dfa C po2dfa C po2nfa = polnfa. 

The following (very similar) languages show that the inclusions are strict. The language 
{a, c}* ah {a, 6, c}* is recognizable by some one-pass po2dfa but not by a poldfa. The language 
{a,b,c}* ab {b,c\* is recognizable by a po2dfa but not by any one-pass po2dfa. Finally, the 
language {a,b,c}* ab {a,b,c}* is recognizable by some polnfa but not by any po2dfa. The 
equivalence of po2nfa and polnfa is due to Schwentick, Therien, and Vollmer For 
each of the above language classes the membership problem is decidable: The class poldfa 
corresponds to TZ-trivial monoids \14i , one-pass po2dfa correspond to TZ-classes of monoids 
in DA (Theorem \^ and Theorem The algebraic equivalent o/ po2dfa is the variety 

of finite monoids DA JT^, and po2nfa are expressively complete for the level 3/2 of the 
Straubing- Therien hierarchy which is decidable by a result of Pin and Weil 1 131. O 

In analogy to Theorem [T^l there is also an expressively complete two-way automaton 
model for Boolean combinations of right ideals. A two-way automaton is weak if for every 
strongly connected component either all states are final or all states are non-final. Note that 
every partially ordered automaton is weak. The following result is our only general result 
for arbitrary (not partially ordered) deterministic two-way automata. 

Proposition 14 A regular language is a Boolean combination of right ideals if and only if 
it is recognized by a deterministic weak one-pass two-way automaton. 
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Proof. If L is a Boolean combination of right ideals, then L is recognized by a deterministic 
weak (one-way) automaton by Theorem [71 Note that every one-way automaton can also be 
seen as a two-way automaton without left-moving states. 

For the converse, consider a complete deterministic weak one-pass two-way automaton A. 
By Thcorem[2]it suffices to show that L{A) satisfies the lattice identity z{xy)'^ <-> z{xy)'^x. 
The leaving state of u is the state of A which on input u encounters the right end marker <i for 
the first time. Note that, since A is complete and deterministic, there is a unique state with 
this property. Consider words x,y^ z ^ A* . There are only finitely many strongly connected 
components of A. Consequently the pigeonhole principle yields an integer n such that the 
leaving states of z{xy)"' and of are in the same strongly connected component. 

Hence, the same is true for the leaving states p of zixy)^ and q of z{xy)^x. Since A is weak, 
p is final if and only if q is; and because .A is a one-pass automaton we have z{xy)"' G L{A) 
if and only if z{xy)"'x E L{A). This establishes the lattice identity. □ 

Not every deterministic one-pass two-way automaton recognizing a Boolean combination 
of right ideals needs to be weak. Therefore, the equivalence of ^ and Q in Theorem [T^ 
does not follow from Proposition [TH Also note that the analogue of Proposition [U does 
not work for right ideals (resp. prefix-closed languages) and deterministic flip (resp. fully 
accepting) one-pass two-way automata since deterministic two-way automata can also reject 
an input by an infinite cycle in its computation. 

As for one-way automata in Section |4l we get right ideals in VA if the recognizing autom- 
aton is a flip automaton. For a flip automaton, a transition z z' with flnal state z implies 
that z' is final. As an intermediate step, we get a characterization in terms of unambiguous 
monomials. 

Theorem 15 Let L C A* . The following are equivalent: 

1. L G 'DA{A*) is a right ideal. 

2. L is a finite union of unambiguous monomials A^ai ■ ■ ■ A'^akA* . 

3. L is recognized by a complete flip one-pass po2dfa. 

Proof. We first show O ^ dS]). Suppose L G VA{A*) is a right ideal of A*. By Theorem [12] 
there exists a complete one-pass po2dfa A which recognizes L. We show how to obtain an 
equivalent automaton B which is a flip automaton. Let us say that, during a computation, 
a deterministic automaton is in progress mode if after the next transition is taken, the 
automaton scans a position which has not been scanned before. The idea is that we need to 
change into a flnal state only when in progress mode. Note that for acceptance, the crucial 
transition of a (one-pass) two-way automaton is always made in progress mode. Moreover, 
consider an input uav and suppose A scans position \ua\ in progress mode and performs 
a transition into a final state. Then the prefix ua is accepted because .A is a one-pass 
automaton (note that this would not hold if A has already seen some prefix of v during the 
computation). Since L is a right ideal, all words in uaA* are accepted. This shows that 
if in progress mode a transition into a flnal state is made, then we can directly go into a 
flnal, right-moving sink state without changing the language. In total this yields a complete 
one-pass po2dfa which is a flip automaton. It remains to show that we can simulate A in 
such a way that the simulation is aware of when it is in progress mode. 

Assume that A is leaving progress mode. This can only happen by a transition to a 
left-moving state. Suppose the input is factorized as m = uiOi ■ ■ -Umamu' where the a^'s 
correspond to the positions where a state change happened while in progress mode. Note 
that am corresponds to the position scanned before taking the transition because a state 
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change is necessary to leave progress mode. Now since A is deterministic, we see ^ alph(u.i) 
for all i. Moreover since A is partially ordered, m is bounded by the number of states of A. 
Therefore, the simulation can store the word v — ai ■ ■ ■ am in a stack of letters with bounded 
depth in its state space. Using the automaton C of Lemma [TU] with v = ai ■ ■ ■ a™, we can 
simulate A in the subsequent non-progressing phase and recognize when we are scanning 
the frontier am of progress again. The automaton is complete and thus there eventually 
is a transition trespassing the position corresponding to This is when the simulation 
switches back to progress mode. Back in progress mode, the simulation organizes the stack 
by pushing the currently scanned letter to the stack if it causes a state change. 

([3]) (O: Let L = L{A) for a complete one-pass po2dfa which is a flip automaton. For 
every u £ L{A) we construct an unambiguous monomial P{u) — A^ai ■ ■ ■ A'^akA* such that 
u £ P{u) C L{A) and k bounded by the number of states of A. Since there are only finitely 
many such monomials, we have L(A) — Uugl ^l"") ^^"^ ^-'^^^ union is finite. 

To construct P(u) consider u £ L{A) and fix an accepting computation of A on u. Con- 
sider the factorization u = uiai ■ ■ ■ u^aku' where the a^'s correspond to state changes. Let 
P{u) — A\ai ■ ■ ■ A*f.akA* with Ai — alph(ui). Trivially, u e P{u) and k is bounded by 
the number of states of A. Moreover, P{u) is unambiguous because A is deterministic. It 
remains to show P{u) C L{A). Suppose A is in some state z while scanning a 6-position 
of Ui. By construction there is no state change with the next transition, i.e., there is a 
loop z z in A. Consider some word v £ ^'(w) and factorize v = viUi ■ ■ ■ VkUkv' with 
Vi & A* . By construction, there exists a run of ^ on u which eventually trespasses ak into a 
final right-moving state. Then since A is a complete flip automaton, no matter what comes 
beyond the Uk can remedy acceptance. This shows that every v G P(u) is accepted. 

© =^ Every union of unambiguous monomials is in VA, cf. |17lH]. By Proposition [TJ 
we see that L is a right ideal. □ 

Note that property ^ in Theorem states that unambiguity of monomials and the 
ideal property can be achieved simultaneously, which is non-trivial. A two-way automaton 
is fully accepting if all its states are final. As for one-way automata, this yields prefix-closed 
languages (at least for VA) . The following result for prefix-closed languages is an immediate 
corollary of Theorem \T5[ 

Corollary 16 Let L C A* . The following are equivalent: 

1. L £ VAiA*) is prefix-closed. 

2. L is recognized by a fully accepting one-pass po2dfa. □ 

Proof (HI ^ (E]): Let L e VAiA*) be prefix-closed. The complement A* \ L is a right 
ideal in VAiA*) and thus Theorem [T51 vields a complete one-pass po2dfa A — {Z, A, 6, xq, F) 
which is flip and recognizes A* \ L. We can assume xq ^ F since otherwise e £ A* \ L 
and thus A* \ L ~ A* and L — % (and for L = we allow the empty automaton). Let 
A! — {Z \ F, A, 5' , xq, Z \ F) be the deterministic one-pass po2-automaton obtained from A 
by restricting the states to Z \ F, i.e., the transition relation 6' is given hy z z' in A' if 
z, z' € Z\F and z z' in A. Clearly, A' is fully accepting and a straightforward verification 
yields L{A') ^ A* \L{A). 

(0) =^ Suppose L = L{A) for a fully accepting one-pass po2dfa A — {Z, A, 6,xo, Z). 
Let A' = {Z {xf} , A, S' ,xq, {xf}) where Xf is a new right-moving sink state, i.e., S' 
extents d with transitions z Xf for z G Z U {xf} if there exists no z' G Z such that 
{z, a, z') € 6. Then A' is a complete one-pass flip po2dfa and L{A') is a right ideal in VAiA*) 
by Theorem [ini Since L{A) = A* \ L{A'), we see that L e VAiA*) is preflx-closed. □ 
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